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\'Jhen  it  works  well,  factor  analysis  helps  the  investigator  make  sense  of 
large  bodies  of  intertwined  data.  V.'hen  it  works  unusually  well,  it  also  points 
to  interesting  relationships  that  might  not  have  been  obvious  from  examination 
of  the  input  data  alone. 

In  marketing  research,  factor  analysis  can  be  useful  in  four  ways.   First, 
(but  not  necessarily  most  important)  it  can  point  out  the  "latent"  --  that  is, 
"underlying,"  not  directly  observed  --  dimensions  that  account  for  the  relation- 
ships among  product  preferences  or  other  kinds  of  ratings  obtained  in  market 
studies.   For  example,  suppose  one  has  no  idea  of  the  dimensions  consumers  use 
when  choosing  among  types  of  liquors.   If  respondents  are  asked  to  rate  several 
types  of  liquors  (Scotch,  gin,  rum,  bourbon,  vodka,  etc.)  according  to  preference, 
a  factor  analysis  may  reveal  some  salient  characteristics  of  liquors  that  under- 
lie the  relative  preferences.   These  characteristics  might  turn  out  to  be  sensory 
attributes  such  as  degree  of  sweetness,  "bouquet,"  or  intoxicative  potency.  They 
miglit  turn  out  to  be  intangible  attributes  such  as  stereotyped  product  images. 
Or  they  might  turn  out  to  be  some  mixture  of  both.   The  results,  whatever  they 
are,  will  not  dictate  infallible  marketing  policy.   But  they  will  probably  be 
of  some  help  to  tlie  marketer  who  is  trying  to  organize  his  thinking. 

The  second  way  factor  analysis  can  be  helpful  is  by  pointing  out  relation- 
ships among  observed  values  that  v?ere  there  all  the  time  but  not  easy  to  see. 
For  example,  a  factor  analysis  of  cosmatic  use  once  suggested  that  hair  spray  is 
more  closely  associated  v;ith  face  cosmetics,  such  as  eye  shadovj  and  lipstick, 
than  with  other  products  women  use  specifically  for  their  hair.   It  also  showed 
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that  hair  spray,  eye  shadow  and  lipstick  belong  in  a  group  of  purchases  and 
activities  that  include  the  number  of  movies  attended  in  the  past  month.   This 
grouping  pointed  to  a  dimension  of  consumer  behavior  that  seemed  to  be  worth 
intensive  follow-up  research.   It  also  suggested  some  iiinnediate  applications: 
A  lipstick  instead  of  a  coinb  as  a  hair  spray  premium;  movie  themes,  or  movie 
related  prizes,  as  sales  promotions  for  face  cosmetici.   In  the  long  run^the 
latent  dimension  and  the  unforeseen  relationship  are  perhaps  of  more  value  than 
the  specific  immediate  application,  but  on  an  exceptionally  good  day  one  can  have 
all  three. 

Third,  factor  analysis  is  useful  vhen  data  must  be  condensed  and  simplified. 
Suppose,  for  example,  that  some  television  comjnercials  have  been  rated  on  50  or 
60  rating  scales,  and  the  problem  is  to  present  these  ratings  to  a  decision 
maker  who  docs  not  have  three  weeks  to  study  them.   If  several  of  the  scales,  or 
several  groups  of  scales,  are  heavily  correlated  with  one  another  because  they 
are  in  fact  redundant,  factor  analysis  will  suimnarize  the  information  in  them  and 
make  the  whole  set  easier  to  handle.   It  is  this  use  of  factor  analysis  that  coi-ncs 
in  handiest  when  the  marketing  vice-president  says,  "Don't  give  me  ten  pages  of 
numbers.   Just  give  mc  the  main  results." 

Finally,  and  related  to  the  third  use,  factor  analysis  can  be  employed  as 
one  step  in  empirical  clustering  of  products,  media,  stimuli  or  people.   In  the 
previous  example,  for  instance,  an  additional  outcome  might  be  the  clustering 
of  a  large  number  of  commercials  into  a  smaller  number  of  useful,  meaningful  and 
possibly  nonobvious  types. 


Wh at  Ir.  Factor  Analysis? 


v 


Factor  annlysis  is  a  multivariate  statistical  technique  that  addresses 
itself  to  the  study  of  interrelationships  among  a  total  set  of  observed  variables. 
Unlike  multiple  regression  in  which  one  variable  is  explicitly  considered  the 
criterion  (dependent)  variable  and  all  others  as  tlie  predictor  (independent) 
variables,  all  of  the  variables  in  factor  analysis  are  considered  simultaneously. 
In  a  sense,  each  of  the  observed  variables  is  considered  as  a  dependent  variable 
that  is  a  function  of  some  underlying,  latent  and  hypothetical  set 'of  factors. 
Conversely,  one  can  look  at  each  factor  as  a  dependent  variable  that  is  a  function 
of  the  observed  variables. 

Several  methods  of  factor  analysis  are  available,  and  these  several  ncthods 
do  not  necessarily  give  tlie  same  results.   In  this  sense,  factor  analysis  is 
indeed  a  set  of  techniques  ratJier  than  a  single  unique  method. 

Vocabulary 

Factor  analysis  has  some  specialized  concepts  and  terminology.  A  factor 
is  an  underlying  dimension  that  "accounts  for"  several  observed  variables.   For 
example,  consider  the  follov^ing  tabulation  of  all  the  possible  intercorrelctions 
among  nine  variables: 
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Varlablc 

1.  Author,  Fiction 

2.  Author,  Children's  Books 

3.  Newspaper  Reporter 
A.  Computer  Programner 

5.  Bookkeeper 

6.  College  Math  Teacher 

7.  Nurse 

8.  Doctor 

9.  Lab  Technician 


(dec 

TADLE  1 
imals  omitted) 

2 

Correlat 

ions 

Ar.onc.  Vari 

ables 

1 

3 

4 

5 

08 

6    7 

25   05 

8 

28 

9 

76 

48 

20 

18 

76 

47 

19 

07 

25   10 

30 

21 

48 

47 



22 

13 

22 

09 

31 
20 

26 

20 

19 

22 

42 

53 

00 

33 

08 

07 

13 

42 

36 

-01 

09 

18 

25 

25 
10 

22 

53 

36 

08 

31 

33 

05 

09 

00 

-01 

08 

45 

34 

28 

30 

31   20 

09 

31 

45 

48 

18 

21 

26 

33 

IB 

33 

34 

48 

In  this  illustration,  the  variables  are  ratings  of  nine  occupations  by  a 
large  sample   (850)  of  hoinomakers.  On  a  five  point  scale,  each  respondent  rated 
eath  occupation  in  terms  of  how  well  she  thought  she  would  do  in  it  if  she  had 
the  opportunity  to  build  a  career  in  that  field.   The  entry  76  in  Row  1,  column 
2  means  that  homemakcrs  who  thought  they  would  do  well  in  occupation  1  (author, 
fiction)  also  tended  to  say  tjicy  would  do  well  in  occupation  2  (author,  children's 
books).   The  entry  20  in  Row  1,  coIukji  4  neann  that  homemakers  who  thought  they 
would  do  well  as  an  author  of  fiction  had  a  much  weaker  tendency  to  say  they 
would  do  well  aa  a  computer  programmer  (occupation  4). 

It  is  obvious  from  inspection  alone  that  variables  1,  2  and  3  have  some- 
thing in  ccnznon   that  they  do  not  chare  with  the  other  variables.   It  is  also 
obvious  that  variables  4,  5  and  6  form  a  second  group,  and  that  7,  8  and  9  fonn 
a  third.   This  sort  of  pattern  in  raw  data  leads  to  the  inference  that  one  under- 
lying diiaenslon  --  one  factor  --  "accounts  for"  variables  1,  2  and  3;  that  a 
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second  factor  "nccounts  for"  4,  5  and  6;  and  that  a  third  "accounts  for"  7,  8 
ana  9. 

Real  data  do  not  usually  come  this  neatly  dressed.  The  patterns  in  real 
data  arc  not  usually  so  striking  as  in  this  example  contrived  for  clarity  of  , 
exposition.   The  data  are  real,  but  they  were  especially  selected  so  as  to  mrike 
the  relationship  between  the  raw  data  and  the  factor  solution  intuitively  ob- 
vious. A  more  realistic  example  v/ill  be  given  later. 

A  factor  analysis  performed  by  one  of  the  most  commonly  us.ed  methods 
(principal  components  analysis  with  varimax  rotation)  produced  the  follov7ing 
solution,   (For  computational  techniques, , see  Harman,  1967.) 


Variable 

1.  Author,  fiction 

2.  Author,  Childrens  Books 

3.  Nevjspaper  reporter 

4.  Computer  programmer 

5.  Bookkeeper 

6.  Coll  math  teacher 

7 .  Nurse 

8.  Doctor 

9.  Lab  technician 


TABLE 

2 

• 

(decimals  omitted) 

Loadinf,s  on  Factors 

h^ 

A 

■^0- 

B       C 
08       -05 

(communal ity) 
82 

89 

07       -10 

80 

' 

69 

15       -17 

53 

14 

81 

-08 

69 

-01 

76 

03 

57 

20 

73 

-20 

62 

-02 

-13 

-83 

70 

28       14 

-77 

69 

13 

36 

-68 

61  . 

Sura  of  squares 
(eigenvalue) 


223 


197 


183 


603 
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The  entries  in  Coluirins  A,  B  and  C  are  the  factor  loadiny.s.  They  show 
how/ closely  the  9  variables  are  related  to  each  of  the  three  underlying  factors, 
and  they  are  the  key  to  understanding  what  the  factors  mean.   For  instance,  the 
heavy  loadings  of  Autlior,  Fiction;  Author,  Children's  books;  and  Newspaper  re- 
porter on  Factor  A  indicate  that  it  represents  an  underlying  general  interest 
in  things  literary  and  verbal.   Factor  B  evidently  represents  interest  in  figures. 
And  Factor  C  represents  interest  in  the  medical  field.   The  minus  sign  in  front 
of  the  entries  in  Column  C  do  not  affect  the  interpretation  of  the  factor.   In 
fact,  as  a  matter  of  convenience,  it  is  cuistomary  to  "reflect"  (change  all)  the 
signs  of  a  factor  on  vjhich  the  highest  loadings  are  negative.  As  long  as  all 
the  signs  in  a  column  are  changed,  it  is  the  absolute  siiie  of  the  loadings, 

rather  than  the  signs,  that  count. 

2  ' 

The  h  (comnunality)  column  shovjs  how  much  of  each  variable  is  "accounted 

for"  by  the  three  underlying  factors  taken  together.   A  large  communality  figure 
means  that  not  much  of  the  variable  is  left  over  after  whatever  the  factors  re- 
present is  taken  into  consideration.  A  small  comniunality  figure  means  that  the 
factors  taken  together  do  not  account  for  much  of  whatever  the  variable  is  all 
about. 

The  sum  of  squares,  or  cif;Gnvaluc ,  indicates  the  relative  importance  of 
each  factor  in  accounting  for  the  particular  set  of  variables  being  analyzed.   It 
is  sometimes  implied,  mistakenly,  that  the  eigenvalue  indicates  importance  in 
some  more  abstract  sense  --  e.g. ,  the  relative  importance  of  each  factor  in  in- 
fluencing choice,  or  the  relative  dominance  of  each  factor  in  the  respondent's 
world.  Not  so.   The  eigenvalue  refers  to  one  particular  set  of  variables  (and 
one  specific  solution  method)  and  can  easily  be  changed  by  changing  either  the 
method  or  the  mix  of  variables. 


I        The  entry  at  the  far  right  of  the  eigenvalue  row  (603  in  this  case)  is 
the  total  sum  of  squares.   Dividing  it  by  the  number  of  variablep  (6,03/9  -  about 
2/3)  provides  an  index  of  hov?  veil  this  particular  solution  accounts  for  vhat  all 
the  variables  taken  together  represent.   If  the  variables  are  all  very  different 
from  each  other,  this  index  will  be  low.   If  the  variables  fall  into  one  or  more 
highly  redundant  groups,  and  if  the  extracted  factors  account  for  all  the  groups, 
the  index  will  approach  unity. 

Decisions.  Decisions 

As  noted  earlier,  the  name  factor  analysis  is  applied  to  a  variety  of 
procedures.  ^-Jhile  the  natheoatical  analysis  is  pretty  much  the  same,  the  pro- 
cedures provide  a  large  number  of  options  to  the  analyst  to  suit  the  method  to 
his  -purposes.   Unfortunately,  the  resulto  from  these  options  vary  considerably, 
and  there  is  no  sure  guide  as  to  vjhich  options  are  "right."  The  trick  is  to  pick 
the  combination  of  options  that  best  does  the  Job  at  hand.  Among  the  options  on 
which  the  analyst  must  decide  arc  these. 

Correlation,  Covr.risnee  or  Cro.':r.-Pro.''-riCts  >tatrlx?  Although  it  is  most 
common  to  factor  analyze  a  matrix  of  correlations,  as  in  the  illustration  above, 
that  choice  is  not  mandatory.  Since  a  correlation  coefficient  is  derived  from 
fltandard  scores  in  wliich  the  aversges  of  all  variables  arc  set  equal  to  zero  and 
the  variances  equal  to  one,  factor  analysis  of  correlations  loses  two  of  the 
three  types  of  information  contained  in  a  data  matrix,  naroely  the  levels  and 
disperoiono  of  variables.  In  seme  cases,  it  may  be  desirable  to  retain  one  or 
both  of  these  types  of  information  in  analyzing  data.  In  such  cases,  a  covp.rienc« 


matrix  (only  moans  sot  equal  to  zero,  but  variance  not  standardized)  or   croso- 
productB  (unstandardlzed  data  matrix)  may  be  appropriate  as  Input  to  factor 
analysis. 

Generally,  If  the  unite  of  meaourcsant  ere  disparate  among  observed 
variables  (as  for  exaaple  betwacn  Inccne  and  education),  it  is  advisable  to 
standardise  the  data  and  hence  use  &   i-natrix  of  correlations.   By  the  same  token,. 
if  the  units  of  tseasurcinent  are  identical  or  very  siiuilar  across  variables,  and 
If  individual  differences  are  expected,  It  may  be  better  to  factor  analyze  a 
matrix  of  covarianccs  or  cross-products  co  ao  not  to  lose  the  infomation 
contained  in  the  means  and  standard  daviations.   For  example,  in  the  illustration 
just  given,   if  some  of  the  occupations  had  received  very  high  average  ratings, 
while  some  received  relatively  low  average  ratings,  or  if  there  had  been  much 
more  disagrccnant  about  some  occupations  than  others,  our  yoe  of  a  correlation 
matrix  ao  a  otarting  point  vould  have  erased  this  possibly  useful  information. 
Moral:   the  moat  widely  used  factor  analytic  procedures  ignore  level  differences 
and  dioperoion  differcncos  among  input  variables.   If  these  differences  are 
important  to  the  Investigator,  alccrn&tivo  precodures  are  available  and  must  be 
followed.   For  further  inforraation  on  this  point  see  Nunnally, (1967). 

What  Goas  Into  th^  Dtat>onal  of  tho.  Corralntlon  M-itrix?  If  the  researcher 
chooses  the  option  of  using  a  matrix  of  correlations  as  the  input  to  factor 
analysis,  one  more  option  on  which  he  raust  decide  is:  What  value  should  be  put 
in  the  diagonal  of  the  matrix  --  the  diagonal  that  represents  correlation  of  a 


•9- 


variablc  with  itself?  The  variouo  options  and  their  coriijcquencen  are  disciit;"ed 
in  Harman  (1967), 

For  many  marketing  research  purposes,  the  best  choice  seeras  to  be  1  (unity). 
This  choice  usually  produces  clear,  intuitively  appcalinjj  solutions,  and  it  is 
now  so  much  in  vogue  that  it  is  the  standard  option  employed  in  many  of  the  r;,ost 
widely  used  canned  computer  programs. 

Rotation:   Another  substantive  option  is  tlie  derivation  of  "new"  factors 
from  the  initial  results  by  the  methods  of  rotation.  As  a  very  rough  analogy, 
rotation  is  soi-aathing  like  stainins  a  microscope  slide.   Just  as  different  stains 
reveal  different  structures  in  tlie  tissue,  different  rotationr.  reveal  different 
structures  in  the  data,  even  thouj;);  in  both  cases  all  the  structures  are' always 
actually  there. 

Different  rotations  give  entirely  different  appearing  results.   From  a 
statistical  point  of  view,  all  results  arc  equal,  none  superior  or  inferior  to 
others;  but  from  the  point  of  view  of  making  sense  of  the  results  of  factor 
analysis,  selecting  the  right  rotation  is  extremely  important. 

■  An  illustration  of  the  effects  of  rotation  is  sho\-m  in  Table  3.   The  "un- 
rotated  factor  loadings  matrix"--  shown  here. for  the  nine  variable  occupation 
problem  --  is  the  first  output  of  any  factor  analysis.   A  glance  at  the  unrotated 
factor  loadings  will  show  that  the  pattern  is  not  at  all  clear.   In  the  matrix  to 
the  right,  the  factors  have  been  rotated  by  the  varimax  method,  a  procedure  that 
produces,  within  each  factor,  as  many  high  loadings  and  as  many  low  loadings  as 
possible. 
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TABLE  3 

COMPAP.ISON  OF   IT:,-ROTATI:o  and  ROTATf-D   FACTOR   SCHITION'S 


Load 

in^s  on 
factor 

unrot 

ated 

Load 

ingG  on 
factors 

rotated 

A 

B 

C 

h^ 

A 

B 

C 

h^ 

68 

-52 

31 

82 

90 

08 

-05 

82 

69 

-51 

26 

80 

89 

07 

-01 

80 

63 

-32 

16 

53 

69 

15 

-17 

53 

58 

53 

27 

69 

14 

81 

-08 

69 

38 

59 

29 

57 

-02 

•  76 

03 

57 

63 

44 

16 

62 

20 

73 

-20 

62 

33 

-04 

-77 

70 

-02 

-13 

-83 

70 

65. 

-04 

-52 

69 

28 

13 

-77 

69 

62 

22 

-^'.2 

61 

13 

36 

-68 

61 

(decimals   omitted) 

Lo 

Variable 

1.  Author,   Fiction 

2.  Author,   Children's  Books 

3.  Newspaper  reporter 

4.  Co:iiputer  prograraaer 

5 .  Bookkeeper 

6.  Coll  math   teacher 

7 .  Nurse 

8.  Doctor 

9.  Lab   teclmician 

Sum  of  squares  312    151     139    602  223    197    183    602 

(eigenvalue) 

Note  that  the  concnunality  for  each  variable  rersains  the  same  regardless  of 
rotation.   This  is  another  v.-ay  of  saying  tlint  the  rotated  factors  taken  together 
account  for  exactly  the  sar.ie  air.ount  of  each  variable  as  the  unrotatcd  factors  do. 
It  is  just  that  the  "v/eight?'of  each  Vt'.ririblc  on  each  factor  is  nov;  redistributed. 

Tlie  eigenvalue;-  have  changed,  since  they  are  the  surr.s  of  the  squares  of  the 
loadings.   In  the  unrotatcd  i/.atrix,  one  factor  (Factor  A)  dominates  the  picture. 
In  the  rotated  matri::,  tJie  leadings  have  shifted  so  tliat  the  eigenvalues  are  nov; 
more  nearly  equal.   Had  the  input  variables  been  different  --  for  instance,  had 
variables  4,  5  and  6  also  been  literary  occupations  --  tlie  relative  sizes  of  the 
eigenvalues  in  the  rotated  solution  would  liave  shifted  to  reflect  the  greater 
weight  of  literary  activities  in  ti'.o  altered  variable  mix. 
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When  to  Stop  Frctorlnr;.   When  a  large  unorganisccd  set  of  variables  is 
factored,  as  ts  often  the  case  in  inarkcting  recearch,  the  analysis  will  extroct 
the  larjjest  and  most  interesting  combinations  of  variables  first  and  then  proceed 
to  smaller  combinations.   For  exnr.ple,  in  an  analysis  of  grocery  product  use,  the 
first  factor  extracted  v;as  a  group  of  food  and  laundry  products  that  appeared 
upon  examination  to  be  products  that  are  consumed  in  quantity  by  large,  middle 
income  families.   The  next  factor  consisted  of  products  used  for  wrapping  and  pre- 
serving food.   The  next  was  a  group  of  foods  used  heavily  by  relatively  lew  incone 
families,  and  fhe  next  was  a  group  of  products  that  are  supposed  to  germproof  and 
deodorize  the  home.   All  of  these  product  groups  were  interesting  and  meanin^;ful, 
but  as  the  analysis  proceeded  the  groups  became  smaller  and  less  understandable, 
until  finally  the  "groups"  consisted  of  only  one  of  the  remaining  products  each. 

Carrying  an  analysis  too  far  has  two  penalties.   It  is  exceedingly  v-aste- 
ful  of  computer  time,  and  it  obscures  the  meaning  of  the  findings  because  it 
affects  tlie  rotation  adversely,   V.'hen  many  factors  arc  involved  in  varima:: 
rotation,  the  tendency  is  to  produce  rotated  factors  that  have  very  high  loadings 
on  a  very  few  variables.  This  produces  fragments.   On  the  other  hand,  if  very 
fev;  factor.s  are  rotated,  tlie  tendency  is  to  have  moderate  to  low  loadings  on 
quite  a  few  variables  so  that  no  meaningful  interpretation  is  possible.   To 
return  to  the  microscope  analogy,  choosing  the  number  of  factors  to  be  extracted 
is  something  like  focusing.   Too  high  or  too  low  an  adjustment  will  obscure  a 
structure  that  is  obvious  v:hen  the  adjustment  is  just  right. 
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Four  stopping  criteria  may  be  employed.   When  the   analyst  already  knows 
how  many  factors  he  wants  to  get  out,  he  can  have  the  analysis  stopped  after 
the  desired  number  of  factors  have  been  extracted.   Iii  raarkcting.  -csearch  this 
situation  is  rare.   Second,  if  he  has  a  clear  idea  in  advance  about  hov;  much  of 
each  variable  the  factors  can  explain  (also  a  rare  privilege  in  mai-Ucting  research), 
he  can  stop  wlicn  that  criterion  is  reached.   Most  coir.rnonly,  however,  if  he  does 
not  know  very  much  about  his  data  to  begin  with,  he  will  want  to  keep  factoring 
until  factors  get  small  and  meaningless.  "   . 

T\oo  criteria  for  "small  and  meaningless"  are  often  used.   If^after  a 
certain  number  of  factors  have  been  extracted* the  eigenvalue  of  the  next  factor 
drops  to  a  sharply  lower  level,  the.    factor  with  the  low  eigenvalue  may  be  dis- 
carded and  factoring  may  be  stopped.   Second,  when  all  factors  vjIio-sc  eigenvalues 
are  greater  than  unity  have  been  extracted,  the  factoring  may  be  stopped. 

The  reader  who  lias  corv.e    this  far  vjill  knov.'  \vhy  critics  of  factor  analysis 
have  insisted  that  it  is  a  branch  of  theology  rather  than  a  properly  objective 
scientific  I'letliod.   VJhat  to  correlate,  tlje  type  of  data  input,  the  entries  in 
the  diagonal,  the  type  of  rotation  used  and  tlic  stopping  criterion  arc  all 
decisions  the  analyst  inusr  make.   The  decisions  create  numerous  combinations, 
and  are  not  easy  or  automatic.   Along  with  the  data,  they  determine  the  results. 

A  Realistic  Exa;;iplc.   As  noted  earlier,  the  nine-variable  example  that 
employed  occupation  ratings  \«s  deliberately  simplified  to  make  an  intuitive 
understanding  of  the  terms  and  procedures  easier,   A  more  realistic  example  will 
now  be  presented  --  more  realistic  in  the  sense  that  it  eiv.ploys  more  variables, 
and  more  realistic  in  the  sense  that  the  pattern  in  the  input  variables  is  not 
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so  self-evident  as  tc  vas  in  tlie  case  of  the  occupation  ratings.   In  this 

example  the  data  consist  of  correlations  in  reported  reading  of  30  magazines 

by  a  large  sample  (930)  of  adult  nales.   The  fact  that  the  valuable  Business  Week 

correlates  19  with  Life  and  31  vith  Kev;  Yorlier  says  that  Business  Week  and 

Kcw  Yorker  have  wore  in  cor.^mon  than  do  nuslncss  ^.'eek  and  Life ,  at  least  in 

this  sample.   Note  that  tlie  correlations  are  not  very  distinct  from  one  anoth.or, 

and  therefore,  cy^'balling  the  matrix  docs  not  show  any  obvious  simple  groups. 

(Insert  Table  4  here)  (p.  14) 

When  the  rolation.'jhips  are  as  complex  and   as   numerous  as  these  (and  this  is  a 
simple  and  small  matrix  compared  to  some  tliat  are  generated  in  inarl;eting  re- 
search), factor  analysis  will  help  sort  them  out. 

This  matrix  of  correlations  was  factor  analyi;ed  by  the  principal  coi.^pononts 
method  in  whicli  ones  v;ere  placed  in  the  diaj^onai. 

The  first  output  of  the  factor  analysis  looked  like  this: 

(Insert  Table  5  here)  (p.  15) 

The  columns  represent  the  factors.   The  items  being  analy::ed  are  listed 
douT)  the  side.   The  entries  in  tl-.e  columns  are  the  factor  loadings,  the  cor- 
relation between  eac)i  item  and  the  factor.   This  is  an  unrotaLed  matrix  and, 
as  usual,  its  meaning  is  not  clear. 

The  matrix  in  Table  6  is  the  product  of  a  variraax  rotation.   The  rotation 
has  clarified  the  factors,  and  they  can  now  be  interpreted  as  follows: 

(Insert  Table  6  here.)  (p.  16) 
Factor  1  has  high  loadings  on  Car  and  Driver,  Road  and  Track,  Motor  Trend 
^■^^  Hot  Rod.   This  means  that  respondents  who  say  they  read  Car  and  Driver 
also  tend  to  say  tlicy  read  the  other  m.agazines  that  load  high  on  Factor  1.   In 
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PRIK'CIPAT.  COMrONKWTS   FACTOll   LOADXKOS 


(dccima 

Is  oniiLti 

L>ci) 

'■ r- — 

Variable 

Var. 

No. 

Factor 

, 

Dt>scr  <pt:  on 

1 

2 

3 

i-t 

5 

6 

7 

8 

9 

10 

Corrmunallty 

Bui.  Week 

1 

35 

43 

-12 

04 

06 

-28 

-33 

17 

15 

00 

56 

Life 

2 

AS 

26 

07 

-27 

16 

33 

-03 

-05 

21 

-28 

61 

Kew  Vorkcr 

3 

30 

51 

-21 

19 

-19 

-13 

04 

17 

00 

-11 

52 

Time 

4 

37 

45 

-04 

-12 

05 

-08 

-32 

-20 

05 

-21 

55 

Newsweek 

5 

43 

38 

05 

-03 

12 

05 

-10 

33 

11 

32 

58 

U.S.  News  & 

World  Rpt. 

6 

24 

37 

11 

15 

23 

00 

-12 

14 

18 

47 

57 

S.it.  Review 

7 

25 

36 

-10 

32 

-03 

04 

41 

32 

-01 

-20 

61 

Look 

8 

45 

18 

15 

-20 

16 

41 

18 

-05 

22 

-02 

58 

Sat.  Ev.  Post 

9 

39 

20 

20 

-23 

12 

38 

23 

-16 

14 

-21 

58  • 

Forbes 

10 

20 

36 

-07 

08 

01 

-31 

-22 

-37 

-05 

04 

47 

Argosy 

11 

45 

-13 

19 

-18 

-07 

23 

-19 

10 

-49 

05 

64 

Atl.  Mthly. 

12 

21 

35 

-10 

23 

-24 

18 

35 

-25 

-14 

36 

65 

Car  6.  D'iver 

13 

37 

-38 

-46 

17 

04 

19 

-21 

-05 

17 

01 

63 

Fid.  &  Stream 

U 

50 

-21 

46 

-02 

-43 

-14 

-04 

00 

18 

04 

75 

Farm  Jrnl-. 

15 

-02 

-09 

49 

63 

32 

09 

-11 

-06 

-01 

-08 

77 

For  tune 

16 

29 

44 

-17 

14 

-12 

-19 

-25 

-32 

-15 

-16 

58 

Harpers 

17 

17 

35 

-21 

29 

-32 

22 

28 

-27 

-13 

19 

63 

Kech.  lUus. 

18 

57 

-31 

-09 

-06 

24 

-29 

18 

-16 

-04 

02 

63 

Pop.  Mech. 

19 

57 

-30 

-03 

-04 

26 

-40 

31 

-08 

-04 

06 

75 

Pop.  Sci. 

20 

50 

-23 

-10 

-10 

36 

-36 

33 

-01 

-05 

-01 

69 

Outdoor  Life 

21 

43 

-24 

43 

-00 

-44 

-19 

01 

01 

13 

01 

67 

Prog.  Parmer 

22 

-03 

-08 

30 

44 

13 

00 

07 

-24 

04 

-30 

46 

Reader's  Dg3t. 

23 

21 

25 

29 

-16 

40 

02 

-09 

02 

-23 

13 

45 

Road  ft   Track 

24 

42 

-37 

-41 

19 

-09 

21 

-18 

02 

11 

-10 

63 

Sci.  Atner. 

25 

26 

18 

-16 

22 

-10 

-12 

11 

45 

-22 

-35 

58 

Succ.  Frmng. 

26 

-01 

-11 

48 

54 

25 

12 

-13 

;6 

-10 

04 

63 

Sports  Afield 

27 

44 

-23 

43 

05 

-39 

-10 

-01 

05 

22 

03 

64 

True 

28 

45 

-11 

24 

-22 

-13 

15 

-12 

03 

-52 

-05 

65 

Hot  Rod 

29 

36 

-41 

36 

15 

04 

18 

-12 

00 

-05 

09 

51 

Motor  Trend 

30 

48 

-42 

-37 

21 

04 

16 

-13 

03 

-01 

06 

65 

Xlgenvslues 


413     292     225     169      150      144      125      106     101      100 
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variable  Var. 

Description  No. 
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TA)-.LE   6 

ROTATED  1?ACX0RJVa'RI>: 
(decipials   oinitted) 

Factor 


1 


8 


9'   10 


Bus.  Week  1 

Life  2 

New  Yorker  3 

Time  A 

Newsweek  5 

U.S.  Neva  & 

World  Rpt.  6 

Sat.  Review  7 

Look  8 

Sat.  Ev.  Post  9 

Forbes  10 

Argosy  11 

Atl.  Mthly.  12 

Car  i  Driver  13 

Fid.  U   Stream  U 

Parra  Jrnl.  15 

Fortune  16 

Harpers  17 

Hech.  Illus.  18 

Pop.  Mech.  19 

Pop.  Scl.  20 

Outdoor  Life  21 

Prog.  Parraer  22 

Reader's  Dgst.  23 

Road  (x   Trcck  24 

Scl.  Aner.  25 

Succ.  Frnng.  26 

Sporto  Afield  27 

Trua  28 

Hot  Rod  29 

Motor  Trend  30 


05  51  06  -lU  02  03  -15  26  -06  li 

07  20  04  -06  li  01  -01  08  09  iw 

00  37  OU  -11  05  -05  21  53  -07  19 

-00  65  02  -06  31  -01  -07  07  08  12 

03  10  09  -09  20  04  05  20  U  68 


03 

12 

00 

12 

07 

03 

14 

-02 

-06 

72 

04 

-03 

-03 

09 

17 

09 

26 

69 

-11 

10 

06 

-05 

05 

-01 

71 

09 

13 

00 

08 

20 

05 

03 

07 

03 

74 

08 

11 

03 

10 

-01 

07 

64 

-00 

02 

-06 

12 

15 

-07 

-03 

09 

17 

00 

17 

-01 

12 

04 

01 

01 

75 

07 

02 

08 

02 

-02 

07 

02 

78 

07 

03 

13 

78 

04 

01 

-04 

04 

10 

-02 

-04 

-06 

01 

05 

04 

84 

04 

09 

10 

-00 

-02 

14 

04 

01 

-03 

05 

87 

-02 

-02 

-04 

-02 

-00 

09 

03 

72 

-02 

-00 

01 

-01 

19 

14 

07 

-03 

08 

15 

-01 

-03 

07 

-09 

75 

15 

-02 

-01 

26 

10 

15 

-01 

09 

72 

-01 

-03 

09 

-01 

16 

03 

19 

00 

03 

82 

00 

04 

05 

05 

12 

00 

03 

-04 

05 

81 

-07 

10 

04 

05 

02 

01 

80 

04 

01 

13 

01 

01 

13 

-02 

04 

07 

07 

61 

07 

05 

02 

03 

-13 

-25 

■24 

12 

-14 

15 

21 

18 

-07 

-08 

36 

34 

77 

03 

10 

-03 

06 

04 

-01 

10 

01 

-07 

07 

08 

01 

-00 

-05 

06 

-04 

74 

14 

-01 

01 

-11 

07 

75 

-08 

-06 

-03 

-01 

12 

17 

08 

-03 

78 

10 

03 

03 

-00 

01 

07 

06 

06 

05 

23 

-03 

13 

10 

02 

04 

75 

-06 

68 

-07 

-01 

-01 

-04 

16 

05 

-03 

13 

01 

77 

-03 

05 

03 

01 

20 

04 

03 

12 

04 

Conmunnl  ity 


56 
61 
52 
55 
59 

57 
61 
58 
58 
47 
64 
65 
63 
75 
77 
58 
63 
63 
75 
69 
67 
47 
45 
63 
58 

63 

64 

65 
51 
65 


Sums  of  Squares 


250  190  215   178  188  206   145  152  147   154 
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*The  sij-ns  in  variables  6  and  i)  have  been  reflected  ns  an   aid  to  interpretation;. 
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other  words,  these  four  mugazlnes  form  a   group  based  on  sorae  degree  of  comni.  a 
audience. 

Factor  2  has  relatively  high  loadings  on  Fortune,  Forbes ,  Time  and 
Bu  sine  so  Wo.rk.   Again,  the  interpretation  is  that  magazines  in  this  group  have 
more  cudience  overlap  with  each  other  than  with  magazines  that  loadhighly 
other  factors.   The  inference  is  that  they,  like  the  first  group,  represent 
some  conaion  core  of  Intereats. 

Factor  3  has  high  loadings  on  Field  and  Stream,  Outdoor  Life  and  Sports 
Afield.   Factor  4  has  high  loadings  on  FanM  Journal  ,  £ucc!:>-3n£u1  Fannint;  and 
Progressiva  Ffa-mor.   Tlius,  one  can  proceed  through  the  whole  matrix,  factor  by 
factor,  looking  for  high  loadings  to  determine  what  the  various  factors  "mciiu." 

The  Meaniix^'.  of  "Loading','.'   Kote  that  all  the  magazincG  have  a  loading  of 
some  size  on  every  frctor,  c-nd   that  for  any  one  factor  the  loadings  of  a  few 
magazines  arc  If.rge  -.bile  the  loadings  of  the  other  tcagaxines  are  small.   This 
rsGul.t  is  exactly  what  r  vari-aax  rotr.tioa  ic  intended  to  ach'.eve.   Other  rotation 
systems,  designed  to  achieve  other  outcomoi;,  would  h.nve  produced  a  different 
configuration  from  the  ^..ma  ;.nrotated  matrix.   The  loadings  portray  the  degree 
to  V/hich  tliC  indivjducl  if',.  3  (niaga;:iucs  in  t:hic  case)  represent  the  factor  as 
a  vjhole.   Thus,  Pnrtune  and  Forh^  c-crve  better  than  Bu.sine£.£  V?eek  as  repre- 
sentatives of  the  underlying,  latent  dimanaion  signified  by  Factor  2.   If  the 
high  loaded  items. on  a  factor  are  thought  of  as  a  group,  the  highest  loaded 
itcmii  are  the  best  instances  of  whatever  it  is  that  holds  the  group  together. 
In  the  present  example,  the  cement  that  glues  the  groups  is  presumably  editorial 
content  vriYich  makes  megazincs  within  groups  appealing  to  somewhat  the  same  group 
of  readers. 
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Note  that  Reader's  Dlcrcr. t  does  not  IiEve  a  high  loading  on  any  of  the 
factors.   Instead  its  positive  loadings  are  divided  cnong  the  news   group 
(Factor  10)  beat  typified  by  U.S.  Nevn  aiid  I'ew:;\-?f-ek,  a  "general  reading"  group 
(Factor  5,  Life ,  Look,  Saturday  Evening;  Post),  a  men's  fiction  group  (Factor  9, 
Argosy  and  True)  and  Factor  4,  a  group  of  fam  lacgar.ines.   Thia  result  iiaplles 
that  Reader's  Dirges t  has  an  appeal  that  spreads  broadly  through  readers  of  at 
least  four  magazine  types.   Re?:dor '  b   Djg.ejt  alao  has  small  negative  loadings 
on  the  sports  car  factor  (Factor  1).   This  finding  shovjs  that  Reader's  Digest's 
broad  appcal-is  not  unlimited  --  that  in  fact  men  who  are  heavy  readers  of  the 
ir.cgaisina  type  ropronentcd  by  He  tor  Trend  tend  not  to  read  Raadrir'a  Dip.est. 

Comnmnal  ity ,   The  colur.ai  to  the  far  right  of  the  matrix,  after  Factor  10, 
shows  the  degree  to  which  the  factors  account  for  or  "explain"  each  of  the 
variables.   Thus  the  factors  extracted  in  thic  analysis  account  for  the  reported 
readership  of  Life  sorr.evhcit  better  than  thoy  account  for  the  reported  readership 
of  Ruflincss  Uaek,  and  they  account  for  Otitdoor  Life  better  than  they  account  for 
Pro?^rcssive  Ffirrr.er  or  Read er ' n   Dlp;ef; t .   The  si2e  of  the  conniunality  is  a  useful 
index  to  hou  much  of  the  variable  is  in  a  sense  "left  over"  after  what  it  has 
in  ccnunon  with  other  variables  has  been  accounted  for.   Tlie  comparatively  lov 
coramunality  of  Reader 'f.  Dl.t^cnt.  for  example,  showa  that  it  has  relatively  little 
In  coxmon  with  the  other  magazines  included  in  this  analysis,  while  the  rela- 
tively high  communalities  of  Field  and  Stream  and  Farm  Journal  shows  that  they  have 
much  in  coirmon  with  the  mogazlna  groups,  t^'.ken  as  p-.roupg ,  that  the  factors 
represent. 
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Factor  Scorer..   Most  factor  analysis  computer  programs  compute  factor 
scores  for  each  respondent  on  each  factor.   The  factor  score  represents  the 
degree  to  which  each  rer^pondeut  gets  high  scores  on  the  t;roup  of  iccms  that 
load  high  on  each  factor.   For  instance,  l^t  us  assume  that  readership  of  the 
magazines  in  Table  6  was  reported  by  each  respondent  on  a  six-category  scale 
ranging  from  "never  read"  to  "read  four  issues  out  of  the  last  four."  The  com- 
puter vjould  determine  each  respondent's  score  on  Factor  1  by  averaging  his  re- 
ported reading  of  each  magazine,  weighted  by  tlie  magazine's  loading  on  Factor  1. 
Thus  since  Car  and  Driver,  Road  and  Track,  Motor  Trend  and  I'.ot  Rod  have  higli 
loadings  on  Factor  1,  each  respondent's  factor  score  on  Factor  1  would  be  deter- 
mined largely  by  his  reading  of  these  magaxincs.   If  he  v;ore  a  heavy  reader  of 
these  magazines,  he  would  have  a  high  score  on  this  factor;  if  he  were  a  light 
(or  non-)  reader  of  these  magazines,  his  score  on  Factor  1  would  be  low. 
Similarly,  since  Ti  r:o. ,  Forbes  and  Forti:ne  have  high  loadings  on  Factor  2,  each 
respondent's  factor  score  on  Factor  2  would  be  determined  largely  by  his  reading 
of  this  set  of  magazines,  so  avid  Tir:-G ,  Forbes  and  Fortunp.  readers  would  get 
high  Factor  2  scores.   Light  or  non-readers  of  these  magazines  would  get  low 
Factor  2  scores,  and  so  on. 

Factor  scores  have  a  variety  of  uses.   Because  they  can  be  cross-tabulated 
with  other  variables,  they  can  help  explain  what  the  factors  mean.   For  instance, 
if  the  interpretation  of  Factors  1  and  2  above  were  not  already  obvious,  some 
insight  into  their  meaning  could  be  obtained  by  cross-tabulating  respondents' 
factor  scores  on  these  factors  with  other  variables,  like  age  and  occupation. 
This  procedure  would  shov;  that  respondents  v;ho  score  high  on  Factor  1  ore  sig- 
nificantly younger  than  respondents  who  score  high  on  Factor  2,  and  that  respondents 
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vho  score  high  on  Factor  2  have  professional  and  managerial  occvipations ,  while 
respondents  v;ho  score  high  on  Factor  1  tend  to  be  blue  collar  workers. 

This  aid  to  interpretation  vould  of  course  have  been  unnecessary  in  the 
magazine  example,  but  it  can  sometimes  be  a  great  help.   In  the  cos;metics  study 
referred  to  earlier,  for  example,  the  underlying  difference  betv;een  two  grou'ps 
of  cosmetics  became  obvious  v;)icn  it  v;as  found  that  one  group  was  used  primarily 
by  younger  women  while  the  otlier  group  v;3s  used  by  women  past  tl\eir  prima.   In 
another  study  the  distinction  betx^een  tv;o  groups  of  grocery  items  vjas  clarified 
wlien  it  was   found  that  one  group  v;as  heavily  used  by  middle  income  families  v;hile 
the  other  group  was  heavily  used  by  low  income  families.   These  relationships 
were  obvious  once  the  factor  scores  had  pointed  them  out,  but  without  the  factor 

scores  to  serve  as  a  guide  through  the  tangle  of  correlations  betx^-een  products  and 
demographic  variables,  it  was  hard  to  know  where  to  look. 

Since  factor  scores  represent  combinations  of  items,  they  can  facilitate 
comparisons  among  groups  of  items  as  groups.   For  example,  the  availability  of 
factor  scores  makes  it  possible  to  say  what  types  of  TV  programs  are  viev7ed  by 
readers  of  what  types  of  m-agaxines ,  or  what  types  of  recreational  interests  go 
with  what  types  of  taste  in  clothes. 

Finally,  factor  scores  ca_n  be  treated  as  if  they  were  raw  scores  to 
perform  any  of  a  number  of  multivariate  analyses.   These  include  m.ultiple  re- 
gression, multiple  discriminant  analysis  and  clustering. 

Comments  on  Q  Analysis 

The  analysis  just  described  is  R-type  factor  analysis,  by  far  the  most 
common.   In  R  analysis  high  correlations  occur  when  respondents  wlio  score  high 
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on  variable  1  also  score  high  on  variable  2,  and  respondents  who  score  low  on 
variable  1  also  ccore  low  on  variable  2.   Factors  emerge  when  there  are  (re- 
latively) high  correlations  within  groups  of  variables. 

In  Q  analysis,  the  correlations  are  coiTiputcd  between  pairs  of  respondents 
instead  of  pairs  of  variables.   High  correlations  occur  when  respondent  I's 
pattern  of  responses  on  all  the  variables  is  much  like  respondent  2's  pattern 
of  responses.   Factors  emerge  when  there  are  relatively  high  correlations  v,-ichin 
groups  of  people. 

Q  analysis  is  useful  when  tiie  object  is  to  sort  people  into  groups  based 
on  their  simultaneous  responses  to  all  tlie  Vc'.riables ,   It  is  th.crefore  beinj 
increasingly  employed  in  market  segmentation  studies. 

Two  difficulties  with  Q  analysis  have  limited  its  use  to  date.   CurrcnLly 
available  Q  analysis  con-.putcr  programs  do  not  handle  even  moderately  large  sai^ples 
of  respondents  easily,  and  reliability  tests  have  suggested  that  Q  factors  ar... 
disappointingly  unstable  from  sample  to  sample.   It  seems  reasonable  to  expect 
that  these  problems  vjill  be  overcome  or  at  least  better  understood  as  time  goes 
by,  and  that  Q  analysis  or  one  of  its  mathematical  relatives  will  become  a 
standard  and  important  market  research  technique.   For  more  on  Q  analysis  see 
Stephenson  (1953)  and  Schlinger  (1969). 

Difficulties,  Problems  cv.d   Cautions 

Cost.   A  factor  analysis  of  even  moderate  size  requires  a  prodigious 
amount  of  number  cruncliing.   Before  the  advent  of  computers,  factor  analytic 

studies  employing  fifty  or  m.ore  variables  v;ere  alm.ost  never  attempted,  and  even 
much  sn.allcr  studios  required  so  m:any  hours  of  labor  on  a  hand  calculator  that 

they  vara   seldom  replicated,  checked  for  reliability  or  even  examined  for  arith- 
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metic  errors.   Perhaps  it  was  the  heroic  amount  of  effort  required  that  led  some. 
of  the  early  analysts  to  believe  that  their  work  had  revealed  the  Truth. 

Although  coitiputerized  factor  analysis  is  now  much  faster  and  much  easier, 
it  is  still  not  costless.   The  cost  of  an  R  analysis  increases  roughly  linearly 
with  the  number  of  respondents,  and  it  increases  much  faster  than  linearly  v.'ith 
the  number  of  variables.   The  cost  of  a  Q  analysis  is  extremely  sensitive  to  the 
number  of  respondents.  '  Large  analyses  arc  liable  to  be  expensive,  and  adding 
variables  to  an  already  large  analysis  is  apt  to  increase  the  cost  far  faster 
than  it  increases  value. 

Reliabil ity.   Like  any  other  statistical  procedure,  a  factor  analysis 
starts  v.'ith  a  set  of  imperfect  data.   VJhen  the  data  change  because  of  changes  in 
the  sample,  clianges  in  data  [gathering  procedures  or  any  of  the  numerous  Icinds  of 
measurement  errors,  the  results  of  the  analysis  will  change  too.   The  results  of 
any  sinj^le  analysis  are  therefore  always  less  than  perfectly  dependable. 

This  problem  is  especially  pernicious  because  the  results  of  a  single 
factor  analysis  usually  look  plausible.   In  fact  they  sometimes  look  so  plausible 
that  the  analyst  is  tempted  to  say  to  himself,  "What's  interesting  about  tliis? 
I  knew  it  all  the  time.'"   (He  didn't). 

It  is  important  to  emphasize  that  plausibility  is  no  guarantee  of  validity 
or  even  stability.   A  factor  analysis  of  randomly  generated  data  will  seem  to 
make  some  sense  when  stared  at  long  enough  and  hopefully  enough^   (Armstrong  and 
Soelberg,  1968.)  A  factor  solution  computed  from  one  randomly  selected  half  of 
the  respondents  may  seem  to  be  an  obvious  representation  of  reality  until  it  is 
placed  beside  a  somewhat  different  but  equally  plausible  solution  computed  from 
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the  rem-^ining  respondents.   The  moral  is:   Do  the  analysis  at  least  tv;ice.   At 
a  minimum,  divide  the  respondents  into  tvjo  groups  at  random  by  means  of  a  rando'i 
number  table  and  clieck  one  group  against  tlie  other.   UTien  different  samples  pro- 
duce similar  results  confidence  increases. 

The  sampling  problem  extends  to  the  set  of  variables  used  in  the  analysis. 
It  should  be  obvious  that  a  diir.cnsion  cannot  cmcrs^e  from  a  factor  analysis  unless 
at  least  t'.vO  variables  arc  in  the  analysis  to  represent  it.   It  is  perhaps  less 
obvious  that  putting  variables  in  and  taking  them  out  vill  influence  the  patterns 
formed  by  other  variables.   If  some  variables  are  added  that  have  a  strong  rela- 
tionship v;ith  so;T.e  variables  that  would  otherx.'ise  h.ave  formed  a  factor,  the 
factor  may  break  into  tv.'o  parts,  one  with  high  loadin[^>,s  on  the  new  and  some  of 
tiie  "old"  variables,  the  other  v.'ith  high  loadings  on  only  the  old. 

Judgment .   It  lias  been  said  that  sending  data  out  to  be  factor  analyzed 
is  much  like  sending  suits  to  tlie  cleaners--you  don't  Iiave  to  knov?  anything  about 
what  v;as  done  to  the  suits  or  the  data  as  long  as  they  come  back  clean  and  free 
from  v;rinkles.   It  should  be  clear  by  now  tb.at  the  problem  is  not  that  simple. 
The  user  of  factor  analysis  maizes  decisions  that  determine  how  the  analysis  v.'ill 
come  out,  or  else  tlie  decisions  arc  made  for  liim.   Even  with  a  given  set  of  de- 
cisions, different  findings  vjill  come  from  different  groups  of  respondents, 
different  ways  of  obtaining  data  and  different  mixes  of  variables. 

Al-1  this  is  highly  disconcerting  to  anyone  v/no  needs  to  believe  that  the 
results  of  any  one  factor  analysis  will  be  }(evealed  Universal  Truth  Forevcr 
Enduring,  and  it  has  sometimes  led  to  disappointment  and  even  indignant  rejection 
of  the  method y   (Ehrenberg,  1968.) 
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If  on  the  other  hand,  f.nctor  analysis  is  regarded  as  one  of  several 
fairly  complicated  tools  that  may  help  untangle  badly  tangled  data,  the  user 
is  much  less  liable  to  feel  cheated  when  he  tries  to  line  up  the  results  of  a 
factor  analysis  against  the  real  v;orld.   VJhen  it  \.;orks  well,  factor  analysis 
helps  the  investigator  make  sense  of  large  bodies  of  intertwined  data.   When 
it  works  unusually  well,  it  also  points  out  some  interesting  relationships  that 
might  not  liave  been  obvious  from  examination  of  the  input  data  alone.   And 
that's  what  makes  all  the  work  v.'orChv.'hile . 
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